Moving Image Archive > Community Video > DrupalCon SF 2010: How to build a Jobs Aggregation Search Engine with Nutch, Apache Solr and Views 3 in about an hour
Nutch is an open web crawler that lets you do fine grained or Internet wide web crawling. In this session I will introduce you to the Drupal Nutch module, which will help with the setup and control of your crawls. We will combine this with some of the new features in the Apache Solr, Views 3 and Apache Solr views to create hybrid search engine vertical that interleaves your content with supporting web content.
The Agenda will be:
1. An introduction to the Apache Nutch crawler
2. An introduction to the Features of the Drupal Nutch module
3. Technical Design decisions on combining crawled data with your Drupal data in Apache Solr
4. Bringing it all together with a demo of a jobs aggregation search engine
5. Questions
Experience: Advanced, Expert
Industry: education, entertainment, library, media
Write a review
Downloaded 754 times
Reviews
Average Rating:
Reviewer:Whatdoesitwant -
-
April 23, 2010 Subject:
drupalcon 2010
This video is part of the drupalcon 2010 set, recorded at DrupalCon San Francisco 2010.